Using Machine Learning to Identify Intonational Segments

نویسندگان

  • Julia Hirschberg
  • Christine H. Nakatani
چکیده

The intonational phrase is hypothesized to represent a meaningful unit of analysis in spoken language interpretation. We present results on the identification of intonational phrase boundaries from acoustic features using classification and regression trees (CART). Our training and test data are taken from the Boston Directions Corpus (task-oriented monologue) and the HUB-IV Broadcast News database (monologue and multi-party). Our goal is two-fold: (1) to provide intonational phrase segmentation as a front end for an ASR engine, and (2) to infer topic structure from acoustic-prosodic features. These efforts are aimed at improving the ease and flexibility of retrieving and browsing speech documents from a large audio database.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine Learning Models for Housing Prices Forecasting using Registration Data

This article has been compiled to identify the best model of housing price forecasting using machine learning methods with maximum accuracy and minimum error. Five important machine learning algorithms are used to predict housing prices, including Nearest Neighbor Regression Algorithm (KNNR), Support Vector Regression Algorithm (SVR), Random Forest Regression Algorithm (RFR), Extreme Gradient B...

متن کامل

Intelligent application for Heart disease detection using Hybrid Optimization algorithm

Prediction of heart disease is very important because it is one of the causes of death around the world. Moreover, heart disease prediction in the early stage plays a main role in the treatment and recovery disease and reduces costs of diagnosis disease and side effects it. Machine learning algorithms are able to identify an effective pattern for diagnosis and treatment of the disease and ident...

متن کامل

Acoustic Classification of Focus: On the Web and in the Lab

We present a new methodological approach which combines both naturally-occurring speech “harvested” on the web and speech data elicited in the laboratory. This proof-of-concept study examines the phenomenon of focus sensitivity in English, in which the interpretation of particular grammatical constructions (e.g. the comparative) is sensitive to the location of prosodic prominence. Machine learn...

متن کامل

An analysis of prosodic information for the recognition of dialogue acts in a multimodal corpus in Mexican Spanish

This paper presents empirical results of an analysis on the role of prosody in the recognition of dialogue acts and utterance mood in a practical dialogue corpus in Mexican Spanish. The work is configured as a series of machine-learning experimental conditions in which models are created by using intonational and other data as predictors and dialogue act tagging data as targets. We show that ut...

متن کامل

Automatically Derived Discourse Segmentation Algorithms Based on Acoustic-Prosodic Features

We describe an investigation aimed at furthering the understanding of how speakers communicate discourse structural information using intonational features. We used the read and spontaneous speech of two speakers from the Boston Directions Corpus (BDC) to automatically identify elements of discourse structure based on intonational features. Unlike previous acoustic-prosodic analyses of discours...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002